Faster cover trees
نویسندگان
چکیده
The cover tree data structure speeds up exact nearest neighbor queries over arbitrary metric spaces (Beygelzimer et al., 2006). This paper makes cover trees even faster. In particular, we provide 1. A simpler definition of the cover tree that reduces the number of nodes from O(n) to exactly n, 2. An additional invariant that makes queries faster in practice, 3. Algorithms for constructing and querying the tree in parallel on multiprocessor systems, and 4. A more cache efficient memory layout. On standard benchmark datasets, we reduce the number of distance computations by 10–50%. On a large-scale bioinformatics dataset, we reduce the number of distance computations by 71%. On a large-scale image dataset, our parallel algorithm with 16 cores reduces tree construction time from 3.5 hours to 12 minutes.
منابع مشابه
Investigation on the Amount of Mortality of Iranian Oak Trees (Quercus brantii Lindi) using Satellite Imagery (Case study: Dashtebarm forests of Fars Province)
In recent years, oak trees in the Zagros forests have suffered a lot of fatalities for unclear reasons. Determining the rate and severity of forest crown density changes is important for the investigation and management of these forests. This research was carried out with the aim of determining the amount of crown cover changes in the forest area of Dashtebarm in Fars province under the influen...
متن کاملUsing Combinatorics to Prune Search Trees: Independent and Dominating Set
We introduce a surprisingly simple technique to design and analyze algorithms based on search trees, that significantly improves many existing results in the area of exact algorithms. The technique is based on measuring the progress of Branch & Bound algorithms by making use of a combinatorial relation between the average and maximum dual degrees of a graph. By dual degree of a vertex, we mean ...
متن کاملAutomated mapping of montane snow cover at subpixel resolution from the Landsat Thematic Mapper
A fully automated method uses Landsat Thematic Mapper data to map snow cover in the Sierra Nevada and make quantitative estimates of the fractional snow-covered area within each pixel. We model winter and spring reference scenes as linear mixtures of image end member spectra to produce the response variables for tree-based regression and classification models. Decision trees identify cloud cove...
متن کاملPosttreatment tree mortality after forest ecological restoration, Arizona, United States.
Pine-oak forests are of high ecological importance worldwide, but many are threatened by uncharacteristically severe wildfire. Forest restoration treatments, including the reintroduction of a surface fire regime, are intended to decrease fire hazard and emulate historic ecosystem structure and function. Restoration has recently received much management attention and short-term study, but little...
متن کاملStudy on the effect of forest stand distribution pattern on results of different estimators of the nearest individual distance method
The Nearest Individual Sampling Method is one of the distance sampling methods for estimating density, canopy cover and height of forest stands. Some distance sampling methods have more than one density estimator that may be skewed to the spatial pattern. Unless the stands of the trees under study have a random spatial pattern. Therefore, the purpose of this study was evaluating the effect of s...
متن کامل